Sparse Multi-pitch and Panning Estimation of Stereophonic Signals
نویسندگان
چکیده
In this paper, we propose a novel multi-pitch estimator for stereophonic mixtures, allowing for pitch estimation on multichannel audio even if the amplitude and delay panning parameters are unknown. The presented method does not require prior knowledge of the number of sources present in the mixture, nor on the number of harmonics in each source. The estimator is formulated using a sparse signal framework, and an efficient implementation using the ADMM is introduced. Numerical simulations indicate the preferable performance of the proposed method as compared to several commonly used multi-channel single pitch estimators, and a commonly used multi-pitch estimator.
منابع مشابه
Acoustic echo cancellation for stereophonic systems derived from pairwise panning of monophonic speech
An algorithm is introduced that performs stereophonic acoustic echo cancellation (SAEC) for systems using pairwise panning of a single monophonic source to provide the effect of spatialisation. The technique exploits the inherent high correlation between the loudspeaker signals, unlike other general SAEC techniques, which try to utilise any small uncorrelated features in the signals. The algori...
متن کاملAn adaptive penalty multi-pitch estimator with self-regularization
This work treats multi-pitch estimation, and in particular the common misclassification issue wherein the pitch at half the true fundamental frequency, the sub-octave, is chosen instead of the true pitch. Extending on current group LASSO-based methods for pitch estimation, this work introduces an adaptive total variation penalty, which both enforces groupand block sparsity, as well as deals wit...
متن کاملFrequency-Dependent Amplitude Panning for the Stereophonic Image Enhancement of Audio Recorded Using Two Closely Spaced Microphones
In this paper, we propose a new frequency-dependent amplitude panning method for stereophonic image enhancement applied to a sound source recorded using two closely spaced omni-directional microphones. The ability to detect the direction of such a sound source is limited due to weak spatial information, such as the inter-channel time difference (ICTD) and inter-channel level difference (ICLD). ...
متن کاملMulti-pitch estimation exploiting block sparsity
We study the problem of estimating the fundamental frequencies of a signal containing multiple harmonically related sinusoidal components using a novel block sparse signal representation. An efficient algorithm for solving the resulting optimization problem is devised exploiting a novel variable step-size alternating direction method of multipliers (ADMM). The resulting algorithm has guaranteed...
متن کاملSemi-Automatic Mono to Stereo Up-mixing using Sound Source Formation
In this paper, we propose an original method to include spatial panning information when converting monophonic recordings to stereophonic ones. Sound sources are first identified using perceptually motivated clustering of spectral components. Correlations between these individual sources are then identified to build a middle level representation of the analysed sound. This allows the user to de...
متن کامل